Identifying important concepts from medical documents

نویسندگان

  • Quanzhi Li
  • Yi-fang Brook Wu
چکیده

Automated medical concept recognition is important for medical informatics such as medical document retrieval and text mining research. In this paper, we present a software tool called keyphrase identification program (KIP) for identifying topical concepts from medical documents. KIP combines two functions: noun phrase extraction and keyphrase identification. The former automatically extracts noun phrases from medical literature as keyphrase candidates. The latter assigns weights to extracted noun phrases for a medical document based on how important they are to that document and how domain specific they are in the medical domain. The experimental results show that our noun phrase extractor is effective in identifying noun phrases from medical documents, so is the keyphrase extractor in identifying important medical conceptual terms. They both performed better than the systems they were compared to.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Intelligent Bio-medical Document Retrieval Using Concept Map Identification

The document retrieval is one of the fast growing and complex research area in the field of information retrieval. Concept-based information retrieval is more beneficial than keyword-based access. The most important advantage is the ability to make use of semantic relationships among concepts in finding relevant documents. Removal of irrelevant documents by identifying conceptual mismatches is ...

متن کامل

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

Identifying relevant medical reports from an assorted report collection using the multinomial naïve Bayes classifier and the UMLS

A patient's electronic medical record contains a large number of medical reports and imaging studies. Identifying the relevant information in order to make a diagnosis can be a time consuming process that can easily overwhelm the physician. Summarizing key clinical information for physicians evaluating brain tumor patients is an ongoing research project at our institution. Notably, identifying ...

متن کامل

بازخوانی اسناد کتیبه‌ای غیرمنقول در میراث جهانی مجموعه بازار تاریخی تبریز

Immovable inscriptions are considered as one of the most important works and among the historical documents in cultural assets of our dear country, which were installed on selected parts of historical buildings and outstanding monuments and were always noticeable. The role of inscriptions as the basic and effective tools is important in terms of manifesting and implication of educational and ed...

متن کامل

Extracting Conceptual Terms from Medical Documents

Automated biomedical concept recognition is important for biomedical document retrieval and text mining research. In this paper, we describe a two-step concept extraction technique for documents in biomedical domain. Step one includes noun phrase extraction, which can automatically extract noun phrases from medical documents. Extracted noun phrases are used as concept term candidates which beco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of biomedical informatics

دوره 39 6  شماره 

صفحات  -

تاریخ انتشار 2006